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Abstract. We use the incompressibility method based on Kolmogorov complexity to determine 
the total number of bits of routing information for almost all network topologies. In most models 
for routing, for almost all labeled graphs 0(n 2 ) bits are necessary and sufficient for shortest path 
routing. By 'almost all graphs' we mean the Kolmogorov random graphs which constitute a fraction 
of 1 — l/n c of all graphs on n nodes, where c > is an arbitrary fixed constant. There is a model for 
which the average case lower bound rises to fl(n 2 logn) and another model where the average case 
upper bound drops to 0(n log 2 n). This clearly exposes the sensitivity of such bounds to the model 
under consideration. If paths have to be short, but need not be shortest (if the stretch factor may 
be larger than 1), then much less space is needed on average, even in the more demanding models. 
Full-information routing requires 0(n 3 ) bits on average. For worst-case static networks we prove a 
H(n 2 logn) lower bound for shortest path routing and all stretch factors < 2 in some networks where 
free relabeling is not allowed. 
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1. Introduction. In very large communication networks like the global tele- 
phone network or the internet connecting the worlds computers, the message volume 
being routed creates bottlenecks degrading performance. We analyze a tiny part of 
this issue by determining the optimal space to represent routing schemes in commu- 
nication networks for allmost all static network topologies. The results give also the 
average space cost over all network topologies. 

A universal routing strategy for static communication networks will, for every 
network, generate a routing scheme for that particular network. Such a routing scheme 
comprises a local routing function for every node in this network. The routing function 
of node u returns for every destination w^tian edge incident to u on a path from u to 
v. This way, a routing scheme describes a path, called a route, between every pair of 
nodes u, v in the network. The stretch factor of a routing scheme equals the maximum 
ratio between the length of a route it produces, and the shortest path between the 
endpoints of that route. 

It is easy to see that we can do shortest path routing by entering a routing table 
in each node u which for each destination node v indicates to what adjacent node w 
a message to v should be routed first. If u has degree d, it requires a table of at most 
n log d bitsfj] and the overall number of bits in all local routing tables never exceeds 
n 2 logn. 

The stretch factor of a routing strategy equals the maximal stretch factor attained 
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by any of the routing schemes it generates. If the stretch factor of a routing strategy 
equals 1 , it is called a shortest path routing strategy because then it generates for every 
graph a routing scheme that will route a message between arbitrary u and v over a 
shortest path between u and v. 

In a full information shortest path routing scheme, the routing function in u 
must, for each destination v return all edges incident to u on shortest paths from u to 
v. These schemes allow alternative, shortest, paths to be taken whenever an outgoing 
link is down. 

We consider point to point communication networks on n nodes described by 
an undirected graph G. The nodes of the graph initially have unique labels taken 
from a set {1, . . . , to} for some m > n. Edges incident to a node v with degree d(v) 
are connected to ports, with fixed labels 1, . . . , d(v), by a so called port assignment. 
This labeling corresponds to the minimal local knowledge a node needs to route: a) 
a unique identity to determine whether it is the destination of an incoming message, 
b) the guarantee that each of its neighbours can be reached over a link connected to 
exactly one of its ports, and c) that it can distinguish these ports. 

1.1. Cost Measures for Routing Tables. The space requirements of a routing 
scheme is measured as the sum over all nodes of the number of bits needed on each 
node to encode its routing function. If the nodes are not labeled with {1, . . . , n} — the 
minimal set of labels — we have to add to the space requirement, for each node, the 
number of bits needed to encode its label. Otherwise, the bits needed to represent 
the routing function could be appended to the original identity yielding a large label 
that is not charged for but does contain all necessary information to route. 

The cost of representing a routing function at a particular node depends on the 
amount of (uncharged) information initially there. Moreover, if we are allowed to 
relabel the graph and change its port assignment before generating a routing scheme 
for it, the resulting routing functions may be simpler and easier to encode. On a 
chain, for example, the routing function is much less complicated if we can relabel 
the graph and number the nodes in increasing order along the chain. We list these 
assumptions below, and argue that each of them is reasonable for certain systems. 
We start with the three options for the amount of information initially available at a 
node. 

I Nodes do not initially know the labels of their neighbours, and use ports to 
distinguish the incident edges. This models the basic system without prior 
knowledge. 

IA The assignment of ports to edges is fixed and cannot be altered. This 
assumption is reasonable for systems running several jobs where the 
optimal port assignment for routing may actually be bad for those other 
jobs. 

IB The assignment of ports to edges is free and can be altered before com- 
puting the routing scheme (as long as neighbouring nodes remain neigh- 
bours after re-assignment). Port re-assignment is justifiable as a local 
action that usually can be performed without informing other nodes. 
II Nodes know the labels of their neighbours, and know over which edge to 
reach them. This information is for free. Or, to put it another way, an in- 
cident edge carries the same label as the node it connects to. This model is 
concerned only with the additional cost of routing messages beyond the im- 
mediate neighbours, and applies to systems where the neighbours are already 
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known for various other reasonsg. 
Orthogonal to that, the following three options regarding the labels of the nodes are 
distinguished. 

a Nodes cannot be relabeled. For large scale distributed systems relabeling 

requires global coordination that may be undesirable or simply impossible. 
(3 Nodes may be relabeled before computing the routing scheme, but the range 
of the labels must remain 1, . . . , n. This model allows a bad distributions of 
labels to be avoided. 
7 Nodes may be given arbitrary labels before computing the routing scheme, 
but the number of bits used to store its label is added to the space require- 
ments of a node. Destinations are given using the new, complex, labels^]. 
This model allows us to store additional routing information, e.g. topological 
information, in the label of a node. This sort of network may be appropriate 
for centrally designed interconnect networks for multiprocessors and commu- 
nication networks. A common example architecture of this type is the binary 
n-cube network where the 2™ nodes are labeled with elements of {0, 1}" such 
that there is an edge between each pair of nodes iff their labels differ in ex- 
actly one bit position. In this case one can shortest path route using only the 
labels by successively traversing edges corresponding to flipping successive 
bits in the positions where source node and destination node differ. 
These two orthogonal sets of assumptions IA, IB, or II, and a, /3, or 7, define the nine 
different models we will consider in this paper. We remark that the lower bounds for 
models without relabeling are less surprising and less hard to prove than the bounds 
for the other models. 

1.2. Outline. We determine the optimum space used to represent shortest path 
routing schemes on almost all labeled graphs, namely the Kolmogorov random graphs 
with randomness deficiency at most clogn which constitute a fraction of at least 
I — l/n c of all graphs, for every fixed constant c > 0. These bounds straightforwardly 
imply the same bounds for the average case over all graphs provided we choose c > 3. 
For an overview of the results, refer to Table [IQ. 

We prove that for almost all graphs f2(n 2 ) bits are necessary to represent the 
routing scheme, if relabeling is not allowed and nodes know their neighbours (II A 
a) or nodes do not know their neighbours (IA V IB)[]. Partially matching this lower 
bound, we show that 0(n 2 ) bits are sufficient to represent the routing scheme, if the 
port assignment may be changed or if nodes do know their neighbours (IB V II). In 



2 We do not consider models that give neighbours for free and, at the same time, allow free 
port assignment. For, given a labeling of the edges by the nodes they connect to, the actual port 
assignment doesn't matter at all, and can in fact be used to represent d(v) log d(v) bits of the 
routing function. Namely, each assignment of ports corresponds to a permutation of the ranks of the 
neighbours — the neighbours at port i moves to position i. There are d(v)\ such permutations. 

3 In this model it is assumed that a routing function cannot tell valid from invalid labels, and 
that a routing function always receives a valid destination label as input. Requiring otherwise makes 
the problem harder. 

4 In this table, arrows indicate that the bound for that particular model follows from the bound 
found by tracing the arrow. In particular, the average case lower bound for model IA A /3 is the 
same as the IA A 7 bound found by tracing — >. The reader may have guessed that a ? marks an 
open question 

5 We write A V B to indicate that the results hold under model A or model B. Similarly, we 
write A A B to indicate the result holds only if the conditions of both model A and model B hold 
simultaneously. If only one of the two 'dimensions' is mentioned, the other may be taken arbitrary 
(i.e., IA is a shorthand for (IA A a) V (IA A j9) V (IA A 7)). 
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no relabeling 
(a) 



permutation 



free relabeling 
(7) 



worst case — lower bounds 
port assignment free (IB) 
neighbours known (II) 



(ra 2 /9)logra [Thm 4.4 1 



Q(n 2 log n 
U(n 2 ) ' 



n 2 /32 [ThmPI 
Q(n 7 / 6 ) jj^ 



average case 



upper bounds 



port assignment fixed (IA) 
port assignment free (IB) 
neighbours known (II) 
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3n 2 
3n 2 



[Thm 
[Thm 
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6nlog 2 n [Thm 3.2 



average case — lower bounds 
port assignment fixed (IA) (n 2 /2)logn [T 
port assignment free (IB) n 2 /2 [Thm 

neuilibours known (II) n 2 /2 [Thm 



1.1] 



1.3 



n 2 /32 [Thm L2\ 
n 2 /32 [ThmjO 
? 



Table 1 

Size of shortest path routing schemes: overview of results. The results presented in this paper are 
quoted with exact constants and asymptotically (with the lower order of magnitude terms suppressed). 



contrast, for almost all graphs, the lower bound rises to asymptotically n 2 /2logn bits 
if both relabeling and changing the port assignment are not allowed (I A A a), and 
this number of bits is also sufficient for almost all graphs. And, again for almost all 
graphs, the upper bound drops to (9(nlog 2 n) bits if nodes know the labels of their 
neighbours and nodes may be arbitrarily relabeled (II A 7). 

Full information shortest path routing schemes are shown to require, on almost 
all graphs, asymptotically n 3 /4 bits to be stored, if relabeling is not allowed (a), and 
this number of bits is also shown to be sufficient for almost all graphs. (The obvious 
upper bound for all graphs is n 3 bits.) 

For stretch factors larger than 1 we obtain the following results. When nodes 
know their neighbours (II), for almost all graphs, routing schemes achieving stretch 
factors s with 1 < s < 2 can be stored using a total of 0{n log n) bits^]. Similarly, for 
almost all graphs in the same models (II), 0(n log log n) bits are sufficient for routing 
with stretch factor > 2. Finally, for stretch factors > 61ogn on almost all graphs 
again in the same model (II), the routing scheme occupies only 0(n) bits. 

For worst case static networks we prove, by construction of explicit graphs, a 
il(n 2 log n) lower bound on the total size of any routing scheme with stretch factor 
< 2, if nodes may not be relabeled (a). 

The novel incompressibility technique based on Kolmogorov complexity, |J, has 
already been applied in many areas but not so much in a distributed setting. A 
methodological contribution of this paper is to show how to apply the incompressibility 
method to obtain results in distributed computing for almost all objects concerned, 
rather than for the worst-case object. This hinges on our use of Kolmogorov random 
graphs in a fixed family of graphs. Our results hold also averaged over all objects 
concerned. 

Independent recent work || [?J applies Kolmogorov complexity to obtain related 
worst-case results mentioned in next section. They show for example that for each 
n there exist graphs on n nodes which may not be relabeled (a), that require in the 
worst case f2(n 3 ) bits to store a full information shortest path routing scheme. We 
prove for the same model that for almost all graphs full information routing n 3 /4 bits 
in total is necessary and sufficient (asymptotically). 



6 For Kolmogorov random graphs which have diameter 2 by Lemma p.q routing schemes with 
s = 1.5 are the only ones possible in this range. 
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1.3. Related Work. Previous upper- and lower bounds on the total number of 
bits necessary and sufficient to store the routing scheme in worst-case static commu- 
nication networks are due to Peleg and Upfal jl(J , and Fraigniaud and Gavoille Q . 

In [jlO| it was shown that for any stretch factor s > 1, the total number of bits 
required to store the routing scheme for some n-node graph is at least Q.(n 1+1 ^ 2s+A ^) 
and that there exist routing schemes for all n-node graphs, with stretch factor s = 
12fc + 3, using 0(k 3 n 1+1 / k logn) bits in total. For example, with stretch factor s — 15 
we have k — 1 and their method guarantees 0(n 2 log n) bits to store the routing 
scheme. The lower bound is shown in the model where nodes may be arbitrarily 
relabeled and where nodes know their neighbours (II A 7). Free port-assignment in 
conjunction with a model where the neighbours are known (II) can, however, not be 
allowed. Otherwise, each node would gain nlogn bits to store the routing function in 
(see the footnote to model II). 

Fraigniaud and Gavoille Q showed that for stretch factors s < 2 there are routing 
schemes that require a total of f2(n 2 ) bits to be stored in the worst case if nodes may 
be relabeled by permutation (/3). This was improved for shortest path routing by 
Gavoille and Perennes || who showed that for each d < n there are shortest path 
routing schemes that require a total of il(n 2 \ogd) bits to be stored in the worst case 
for some graphs with maximal degree d, if nodes may be relabeled by permutation and 
the port-assignment may be changed (IB A (3). This last result is clearly optimal for 
the worst case, both for general networks (d = Q(n)) and bounded degree networks 
(d < n). In Q it was shown that for each d > 3 there are networks for which any 
routing scheme with stretch factor < 2 requires a total of £l(n 2 / log 2 n) bits. 

Interval routing on a graph G — (V,E), V = {1, ...,11} is a routing strategy 
where for each node i, for each incident edge e of i, a (possibly empty) set of pairs 
of node labels represent disjoint intervals with wrap-around. Each pair indicates the 
initial edge on a shortest path from i to any node in the interval, and for each node 
j 7^ i there is such a pair. We are allowed to permute the labels of graph G to optimize 
the interval setting. 

|^| shows that there exist graphs for each bounded degree d > 3 such that for 
each interval routing scheme, each of f2(n) edges are labeled by Q(n) intervals. This 
shows that interval routing can be worse than straightforward coding of routing tables 
which can be trivially done in 0(n 2 logd) bits total. (This improves showing that 
there exist graphs such that for each interval routing scheme some incident edge on 
each of il(n) nodes is labeled by £l(n) intervals, and for each d > 3 there are graphs 
of maximal node degree d such that for each interval routing scheme some incident 
edge on each of f2(n) nodes is labeled by f2(n/logn) intervals.) 

The paper ]3j contains history and background on the compactness (or lack of it) of 
interval routing using probabilistic proof methods. To the best of our knowledge, one 
of the authors of that paper, Jan van Leeuwen, was the first to formulate explicitly the 
question of what exactly is the minimal size of the routing functions, and he recently 
drew also our attention to this group of problems. 

2. Kolmogorov Complexity. The Kolmogorov complexity, ||, of x is the 

length of the shortest effective description of x. That is, the Kolmogorov complex- 
ity C{x) of a finite string x is simply the length of the shortest program, say in 
FORTRAN (or in Turing machine codes) encoded in binary, which prints x without 
any input. A similar definition holds conditionally, in the sense that C(x\y) is the 
length of the shortest binary program which computes x given y as input. It can be 
shown that the Kolmogorov complexity is absolute in the sense of being independent 
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of the programming language, up to a fixed additional constant term which depends 
on the programming language but not on x. We now fix one canonical programming 
language once and for all as reference and thereby CQ. 

For the theory and applications, see |J. Let x,y, z E J\f, where N denotes the nat- 
ural numbers. Identify Af and {0, 1}* according to the correspondence (0, e), (1, 0), (2, 
(3, 00), (4, 01), . . .. Hence, the length \x\ of x is the number of bits in the binary string 
x. Let Ti,T2, ... be a standard enumeration of all Turing machines. Let (■, •) be a 
standard invertible effective bijection from JV x N to J\f. This can be iterated to 
«•,•),•)• 

Definition 2.1. Let U be an appropriate universal Turing machine such that 
U(((i,p),y)) = Tj((p, y)) for all i and {p,y}. The Kolmogorov complexity of x given y 
(for free) is 

C(x\y) = mm{\p\ : U((p,y)) = x,p e {0, 1}*}. 



2.1. Kolmogorov random graphs. One way to express irregularity or ran- 
domness of an individual network topology is by a modern notion of randomness like 
Kolmogorov complexity. A simple counting argument shows that for each y in the 
condition and each length n there exists at least one x of length n which is incom- 
pressible in the sense of C{x\y) > n, 50% of all x's of length n is incompressible but for 
1 bit (C(x\y) > n—l), 75% of all :r's is incompressible but for 2 bits (C(x\y) > n — 2) 
and in general a fraction of 1 — 1/2 C of all strings cannot be compressed by more than 
c bits, @. 

Definition 2.2. Each labeled graph G — (V, E) on n nodes V = {1, 2, . . . ,n} can 
be coded by a binary string E(G) of length n(n — l)/2. We enumerate the n(n — l)/2 
possible edges (u, v) in a graph on n nodes in standard lexicographical order without 
repetitions and set the ith bit in the string to 1 if the i-th edge is present and to 
otherwise. Conversely, each binary string of length n(n — l)/2 encodes a graph on n 
nodes. Hence we can identify each such graph with its corresponding binary string. 
We define the high complexity graphs in a particular family Q of graphs. 

Definition 2.3. A labeled graph G on n nodes of a family Q of graphs has 
randomness deficiency at most 5(n) and is called S(n)-random in Q if it satisfies 

(1) C(E(G)\n,S,g)>log\g\-5(n). 

In this paper we use Q is the set of all labeled graphs on n nodes. Then, log|C/| = 



n(n — 1)/2, that is, precisely the length of the encoding of Definition 2.2. In the sequel 
we say just l 5(n)-random' with Q understood. Elementary counting shows that a 
fraction of at least 



1 - 1/2^ 



of all labeled graphs on n nodes in Q has that high complexity, || . 

2.2. Self-Delimiting Binary Strings. We need the notion of self-delimiting 
binary strings. 

Definition 2.4. We call x a prefix of y if there is a z such that y — xz. A set 
^4 C {0, 1}* is prefix-free, if no element in A is the prefix of another element in A. A 
1:1 function E : {0,1}* — > {0,1}* (equivalently, E : J\f — > {0,1}*) defines a prefix- 
code if its range is prefix-free. A simple prefix-code we use throughout is obtained by 
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reserving one symbol, say 0, as a stop sign and encoding 

x = l |a;| 0x, 
|2| = 2|ar| + l. 
Sometimes we need the shorter prefix-code x': 

x - ■ I x I X •) 
\x'\ = \x\+2\\og(\x\ + l)~\ +1. 

We call x oi x 1 & self- delimiting version of the binary string x. We can effectively 
recover both x and y unambiguously from the binary strings xy or x'y. For example, 
if xy = 111011011, then x = 110 and y = 11. If xy = 1110110101 then x = 110 
and y = 1. The self-delimiting form x' . . . y'z allows the concatenated binary sub- 
descriptions to be parsed and unpacked into the individual items x, . . . , y, z; the code 
x' encodes a separation delimiter for x, using 2 flog( |x | + 1)] extra bits, and so on. 

2.3. Topological Properties of Kolmogorov Random Graphs. High com- 
plexity labeled graphs have many specific topological properties which seems to con- 
tradict their randomness. However, randomness is not 'lawlessness' but rather enforces 
strict statistical regularities. For example, to have diameter exactly two. Note that 
randomly generated graphs have diameter two with high probability. In another pa- 
per H| two of us explored the relation between high probability properties of random 
graphs and properties of individual Kolmogorov random graphs. For this discussion it 
is relevant to mention that, in a precisely quantified way, every Kolmogorov random 
graph individually possesses all simple properties which hold with high probability 
for randomly generated graphs. 

Lemma 2.5. The degree d of every node of a 5(n) -random labeled graph on n 
nodes satisfies 

\d-(n- 1)/2| = O ( V (<*(")+ log . 

Proof. Assume that there is a node such that the deviation of its degree d from 
(n — l)/2 is greater than k, that is, \d — (n — l)/2 > k. From the lower bound 
on C(E(G)\n,5,G) corresponding to the assumption that G is random in Q, we can 
estimate an upper bound on k, as follows: 

In a description of G = (V, E) given n, S we can indicate which edges are inci- 
dent on node i by giving the index of the interconnection pattern (the characteristic 
sequence of the set Vi = {j G V — {i} : (i,j) £ E} in n — 1 bits where the jth bit is 1 
if j G Vi and otherwise) in the ensemble of 

(2) m = ^ rf - 2 "^ 2fe2/3( " _1) 

|d-(n-l)/2|>fc ^ ' 

possibilities. The last inequality follows from a general estimate of the tail probability 
of the binomial distribution, with s n the number of successful outcomes in n experi- 
ments with probability of success p = |. Namely, by Chernoff's bounds, in the form 
used in @,|, 

(3) Pr(|s„ - pn\ >k)< 2e- k ^ 3pn . 

To describe G it then suffices to modify the old code of G by prefixing it with 
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(i) A description of this discussion in 0(1) bits; 

(ii) the identity of node i in [log(n + 1)] bits; 

(hi) the value of d in |~log(n+ 1)] bits, possibly adding nonsignificant O's to pad 
up to this amount; 

(iv) the index of the interconnection pattern in logm bits (we know n, fc and 
hence logm); followed by 

(v) the old code for G with the bits in the code denoting the presence or absence 
of the possible edges that are incident on node i deleted. 

Clearly, given n we can reconstruct the graph G from the new description. The total 
description we have achieved is an effective program of 

logm + lognfc + n(n — l)/2 — n + 0(1) 

bits. This must be at least the length of the shortest effective binary program, which 
is C(E(G)\n, S, Q), satisfying Equation |l|. Therefore, 

logm > n — lognfc — 0(1) — 5(n). 

Since we have estimated in Equation ^| that 

logm < n - (2fc 2 /3(n - l))loge, 



it follows that k < ^\{5{n) + lognfc + 0(1)) (n - l)/loge. □ 

Lemma 2.6. Every o(n) -random labeled graph on n nodes has diameter 2. 
Proof. The only graphs with diameter 1 are the complete graphs which can be 

described in 0(1) bits, given n, and hence are not random. It remains to consider 

G = (V, E) is an o(n)-random graph with diameter greater than 2. which contradicts 

Eq. (|lj) from some n onwards. 

It remains to consider G = (V, E) is an o(n)-random graph with diameter greater 

than 2. Let i, j be a pair of nodes with distance greater than 2. Then we can describe 

G by modifying the old code for G as follows: 

(i) A description of this discussion in O(l) bits; 

(ii) The identities of i < j in 21ogn bits; 

(iii) The old code E(G) of G with all bits representing presence or absence of an 
edge (j, k) between j and each k with (z, fc) s E deleted. We know that all the bits 
representing such edges must be since the existence of any such edge shows that 
(i, fc), (fc, j) is a path of length 2 between i and j, contradicting the assumption that i 
and j have distance > 2. This way we save at least n/4 bits, since we save bits for as 
many edges (j, fc) as there are edges (i, fc), that is, the degree of i, which is n/2 ± o(n) 
by Lemma 



2.5 



Since we know the identities of i and j and the nodes adjacent to i (they are in the 
prehx of code E(G) where no bits have been deleted), we can reconstruct G from this 



discussion and the new description, given n. Since by Lemma 2.5 the degree of i is at 
least n/4, the new description of G, given n, requires at most 

n(n - l)/2 - n/4 + 0(log n) 

bits, which contradicts Equation |l| for large n. □ 

Lemma 2.7. Let c > be a fixed constant and let G is a clogn-random labeled 
graph. Then from each node i all other nodes are either directly connected to i or are 
directly connected to one of the least (c + 3) logn nodes directly adjacent to i. 
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Proof. Given i, let A be the set of the least (c + 3) logn nodes directly adjacent 
to i. Assume by way of contradiction that there is a node k of G that is not directly 
connected to a node in Aljji}. We can describe G as follows: 

(i) A description of this discussion in O(l) bits; 

(ii) A literal description of i in log n bits; 

(iii) A literal description of the presence or absence of edges between i and the 
other nodes in n — 1 bits; 

(iv) A literal description of k and its incident edges in log n+n — 2 — (c+3) log n 

bits; 

(v) The encoding E(G) with the edges incident with nodes i and k deleted, 
saving at least 2n — 2 bits. 

Altogether the resultant description has 

n(n - l)/2 + 21ogn + 2n - 3 - (c + 3) logn -2n + 2 

bits, which contradicts the c log n-randomness of G by Equation |l]. □ 

In the description we have explicitly added the adjacency pattern of node i, which 
we deleted later again. This zero-sum swap is necessary to be able to unambiguously 
identify the adjacency pattern of i in order to reconstruct 67. Since we know the 
identities of i and the nodes adjacent to i (they are the prefix where no bits have been 
deleted), we can reconstruct G from this discussion and the new description, given n. 

3. Upper Bounds. We give methods to route messages over Kolmogorov ran- 
dom graphs with compact routing schemes. Specifically we show that in general (on 
almost all graphs) one can use shortest path routing schemes occupying at most 0(n 2 ) 
bits. If one can relabel the graph in advance, and if nodes know their neighbours, 
shortest path routing schemes are shown to occupy only 0(n log 2 n) bits. Allow- 
ing stretch factors larger than one reduces the space requirements — to 0(n) bits for 
stretch factors of O(logn). 



Let G be an 0(logn)-random labeled graph on n nodes. By Lemma 2.7 we know 



that from each node u we can shortest path route to each node v through the least 



O(logn) directly adjacent nodes of u. By Lemma 2.6, G has diameter 2. Once the 
message reached node v its destination is either node v or a direct neighbor of node 
v (which is known in node v by assumption). Therefore, routing functions of size 
0(n log logn) bits per node can be used to do shortest path routing. However, we can 
do better. 

Theorem 3.1. Let G be an O (logn.) -random labeled graph on n nodes. Assume 
that the port assignment may be changed or nodes know their neighbours (IB V II). 
Then, for shortest path routing it suffices to have local routing functions stored in 3n 
bits per node. Hence the complete routing scheme is represented by in 2 bits. 



Proof. Let G be as in the statement of the theorem. By Lemma 2.7 we know 



that from each node u we can route via shortest paths to each node v through the 



O(logn) directly adjacent nodes of u that have the least indexes. By Lemma 2.6, G 
has diameter 2. Once the message has reached node v its destination is either node v 
or a direct neighbor of node v (which is known in node v by assumption). Therefore, 
routing functions of size 0(n log logn) can be used to do shortest-path routing. We 
can do better than this. 

Let Aq C V be the set of nodes in G which are not directly connected to u. Let 



V\, . . . , v m be the 0(log n) least indexed nodes directly adjacent to node u (Lemma 2.7 ) 
through which we can shortest path route to all nodes in Aq. For t — 1,2 ... ,1 define 
A t — {w e A -\J t s l\ A s : (v t ,w) G E}. Let m = \A \ and define m t +i = m t —\A t +i\. 
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Let I be the first t such that m t < nj log log n. Then we claim that v t is connected 
by an edge in E to at least 1/3 of the nodes not connected by edges in E to nodes 
u,vt, ...,V t -l. 

Claim 1. \A t \ > m t -i/3 for 1 <t <l. 

Proof. Suppose, by way of contradiction, that there exists a least t < I such that 
\\At \ — m t -i/2\ > mt-i/6. Then we can describe G, given n, as follows. 

(i) This discussion in 0(1) bits; 

(ii) Nodes u, Vt in 2 logn bits, padded with 0's if need be; 

(hi) The presence or absence of edges incident with nodes u, v\, . . . , Vt—i in r = 
n — l + -" + n — (t — l) bits. This gives us the characteristic sequences of Aq, . . . , At-i 
in V , where a characteristic sequence of A in V is a string of \V\ bits with, for each 
v G V, the vth bit equals 1 if v € A and the vth bit is otherwise; 

(iv) A self-delimiting description of the characteristic sequence of A t in Aq — 
U«=i A s , using Chernoff's bound Equation ^, in at most m f _i — | (i) m t -i loge + 
0(logm 4 _i) bits; 

(v) The description E(G) with all bits corresponding to the presence or absence 
of edges between v t and the nodes in Aq — IJ*Z 1 A s deleted, saving m t _i bits. Further- 
more, we delete also all bits corresponding to presence or absence of edges incident 
with u, vi, . . . , Vt-i saving a further r bits. 

This description of G uses at most 

n(n - l)/2 + O(logn) + m t _i - Qj m t -i loge - m ( _i 

bits, which contradicts the 0(log n)-randomness of G by Equation |l|, because mt-i > 
nj log log n. □ 

Recall that I is the least integer such that mi < nj log log n. We construct the 
local routing function F(u) as follows. 

(i) A table of intermediate routing node entries for all the nodes in Aq in in- 
creasing order. For each node w in [J s=1 A s we enter in the wih position in the table 
the unary representation of the least intermediate node v, with (u,v),(v,w) G E, 
followed by a 0. For the nodes that are not in (J s =i A s we enter a in their position 
in the table indicating that an entry for this node can be found in the second table. 
By Claim [j], the size of this table is bounded by: 

' 1 /2V -1 °° 1 /2\ s_1 

n + ^ - ( - J sn ^ n + ^2^[^) sn<4n; 

s— 1 ^ ^ s—1 ^ ' 

(ii) A table with explicitly binary coded intermediate nodes on a shortest path 
for the ordered set of the remaining destination nodes. Those nodes had a entry in 
the first table and there are at most mi < nj log log n of them, namely the nodes in 
A)~ Us=i -^s- Each entry consists of the code of length loglogn+0(l) for the position 



in increasing order of a node out of v%, . . . ,v m with m = O(logn) by Lemma 2.1 . 
Hence this second table requires at most 2n bits. 

The routing algorithm is as follows. The direct neighbors of u are known in node u 
and are routed without routing table. If we route from start node u to target node w 
which is not directly adjacent to u, then we do the following. If node w has an entry 
in the first table then route over the edge coded in unary, otherwise find an entry for 
node w in the second table. 
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Altogether, we have ^ 6n. Adding another n— 1 in case the port assignment 

may be chosen arbitrarily, this proves the theorem with 7n instead of 6n. Slightly 
more precise counting and choosing I such that mi is the first such quantity < n/ logn 
shows \F(u)\ < 3n. □ 

If we allow arbitrary labels for the nodes, then shortest path routing schemes of 
0(n log 2 n) bits suffice on Kolmogorov random graphs, as witnessed by the following 
theorem. 

Theorem 3.2. Let c > be a constant and let G be a clogn-random labeled gra- 
ph on n nodes. Assume that nodes know their neighbours and nodes may be arbitrarily 
relabeled (II A j), and we allow the use of labels of (1 + (c+ 3) logn) logn bits. Then 
we can shortest path route with local routing functions stored in 0(1) bits per node 
(hence the complete routing scheme is represented by (c + 3)nlog 2 n + nlogn + 0(n) 
bits). 



Proof. Let c and G be as in the statement of the theorem. By Lemma 2.7 we 
know that from each node u we can shortest path route to each node w t hrou gh the 
first (c + 3) logn directly adjacent nodes f(u) = Vi, . . . , v m of u. By lemma 2J3, G has 
diameter 2. Relabel G such that the label of node u equals u followed by the original 
labels of the first (c + 3) logn directly adjacent nodes f(u). This new label occupies 
(1 + (c + 3) logn) logn bits. To route from source u to destination v do the following. 

If v is directly adjacent to u we route to v in 1 step in our model (nodes know their 
neighbours). If v is not directly adjacent to it, we consider the immediate neighbours 
f(v) contained in the name of v. By Lemma |2.7| at least one of the neighbours of u 
must have a label whose original label (stored in the first log n bits of its new label) 
corresponds to one of the labels in f(v). Node u routes the message to any such 
neighbour. This routing function can be stored in O(l) bits. □ 

Without relabeling routing using less than 0(n 2 ) bits is possible if we allow stretch 
factors larger than 1. The next three theorems clearly show a trade-off between the 
stretch factor and the size of the routing scheme. 

Theorem 3.3. Let c > be a constant and let G be a clogn-random labeled 
graph on n nodes. Assume that nodes know their neighbours (II). For routing with 
any stretch factor > 1 it suffices to have n — 1 — (c + 3) logn nodes with local routing 
functions stored in at most |~log(n + 1)] bits per node, and 1 + (c + 3) logn nodes with 
local routing functions stored in 3n bits per node (hence the complete routing scheme 
is represented by less than (3c + 20)nlogn bits). Moreover, the stretch is at most 1.5. 

Proof. Let c and G be as in the statement of the theorem. By Lemma 2.7 we know 
that from each node u we can shortest path route to each node w through the first 
(c + 3) logn directly adjacent nodes v±, . . . ,v m of u. By Lemma 2.6, G has diameter 2. 
Consequently, each node in V is directly adjacent to some node in B = {u, v\, . . . , v m }. 
Hence, it suffices to select the nodes of B as routing centers and store, in each node 
w G B, a shortest path routing function F(w) to all other node s, o ccupying 3n bits 
(the same routing function as constructed in the proof of Theorem 3T if the neighbours 
are known). Nodes v G V — B route any destination unequal to their own label to 
some fixed directly adjacent node w G B. Then l-FXu) < [log(n + 1)] + 0(1), and 
this gives the bit count in the theorem 

To route from a originating node v to a target node w the following steps are 
taken. If w is directly adjacent to v we route to w in 1 step in our model. If w is not 
directly adjacent to v then we first route in 1 step from v to its directly connected 
node in B, and then via a shortest path to w. Altogether, this takes either 2 or 3 
steps whereas the shortest path has length 2. Hence the stretch factor is at most 1.5 
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which for graphs of diameter 2 (i.e., all c log n-random graphs by Lemma 2.6) is the 
only possibility between stretch factors 1 and 2. This proves the theorem. □ 

Theorem 3.4. Let c > be a constant and let G be a clogn-random labeled 
graph on n nodes. Assume that the nodes know their neighbours (II). For routing 
with stretch factor 2 it suffices to have n — 1 nodes with local routing functions stored 
in at most log log n bits per node and 1 node with its local routing function stored in 
3n bits (hence the complete routing scheme is represented by n log log n + 3n bits). 

Proof. Let c and G be as in the statement of the theorem. By Lemma 2.6, G has 
diameter 2. Therefore the following routing scheme has stretch factor 2. Let node 1 
store a shortest path routing function. All other nodes only store a shortest path to 
node 1 . To route from a originating node v to a target node w the following steps are 
taken. If w is an immediate neighbour of v, we route to w in 1 step in our model. If 
not, we first route the message to node 1 in at most 2 steps, and then from node 1 
through a node v to node w in again 2 steps. Because node 1 stores a shortest path 
routing function, either v — w or w is a direct neighbour of v. 

Node 1 can store a shortest path routing function in at most 3n bits using the 



same construction as used in the proof of Theorem 3.1 (if the neighbours are known) 



The immediate neighbours of 1 either route to 1 or directly to the destination of the 
message. For these nodes, the routin g fu nction occupies 0(1) bits. For nodes v at 
distance 2 of node 1 we use Lemma [2.7| , which tells us that we can shortest path 
route to node 1 through the first (c + 3) logn directly adjacent nodes of v. Hence, to 
represent this edge takes loglogra + log(c + 3) bits and hence the local routing function 
F(v) occupies at most log log n + 0(1) bits. □ 

Theorem 3.5. Let c > be a constant and let G be a clogn-random labeled 
graph on n nodes. Assume that nodes know their neighbours (II). For routing with 
stretch factor (c + 3)logn it suffices to have local routing functions stored in 0(1) bits 
per node (hence the complete routing scheme is represented by O(n) bits). 

Proof. Let c and G be as in the statement of the theorem. From Lemma 2/7 
we know that from each node u we can shortest path route t o ea ch node v through 
the first (c + 3) logn directly adjacent nodes of u. By Lemma 2J5, G has diameter 2. 
So the local routing function — representable in 0(1) bits — is to route directly to 
the target node if it is a directly adjacent node, otherwise to simply traverse the first 
(c + 3) logn incident edges of the starting node and look in each of the visited nodes 
whether the target node is a directly adjacent node. If so, the message is forwarded 
to that node, otherwise it is returned to the starting node for trying the next node. 
Hence each message for a destination at distance 2 traverses at most 2(c + 3) logn 
edges. 

Strictly speaking we do not use routing tables at all. We use the fact that a 
message can go back and forth several times to a node. The header of the message 
can code some extra information as a tag "failed." In this case it is possible to 
describe an 0(1) bit size routing function allowing to extract the header from the 
destination without knowing about logn, for example by the use of self-delimiting 
encoding. □ 

Theorem 3.6. Let G be an O(logn) -random labeled graph on n nodes. Assume 
that nodes do not know their neighbors and relabeling and changing the port assignment 
is not allowed (IA A a). Then, for shortest path routing it suffices that each local 
routing function uses (n/2)logn(l + o(l)) bits (hence the complete routing scheme 
uses at most (n 2 /2) logn(l + o(l)) bits to be stored). 

Proof. At each node we can give the neighbors by the positions of the l's in a 
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binary string of length n — 1. Since each node has at most n/2 + o(n) neighbours 



by Lemma 2.5, a permutation of port-assignments to neighbors can have Kolmogorov 
complexity at most (n/2)logn(l + o(l)) |J. This permutation it describes part of 
the local routing function by for each direct neighbour determining the port to route 
messages for that neighbour over. If G is 0(log n)-random then we only require 0(n) 



bits additional routing information in each node by Theorem 3.1. Namely, because 
the assignment of ports (outgoing edges) to direct neighbors is known by permutation 
7r we can use an additional routing table in 3n bits per node to route to the remaining 



non-neighbor nodes as described in the proof of Theorem 3.1. In total this gives 
(n 2 /2) log ra(l + o(l)) bits. □ 

Our last theorem of this section determines the upper bounds for full information 
shortest path routing schemes on Kolmogorov random graphs. 

Theorem 3.7. For full-information shortest path routing on o(n)-random labeled 
graphs on n nodes where relabeling is not allowed (a), the local routing function oc- 
cupies at most n 2 /4 + o(n 2 ) bits for every node (hence the complete routing scheme 
takes at most n 3 /4 + o(n 3 ) bits to be stored). 

Proof. Since for o(n)-random labeled graphs on n the node degree of every node 



is n/2 + o(n) by Lemma 2.5, we can in each source node describe the appropriate 
outgoing edges (ports) for each destination node by the l's in a binary string of 
length n/2 + o(n). For each source node it suffices to store at most n/2 + o(n) such 
binary strings corresponding to the non-neighboring destination nodes. In each node 
we can give the neighbors by the positions of the l's in a binary string of length n — 1. 
Moreover, in each node we can give the permutation of port assignments to neighbors 
in (n/2) logn(l -I- o(l)) bits. This leads to a total of at most (n 2 /4)(l + o(l)) bits per 
node and hence to (n 3 /4)(l + o(l)) bits to store the overall routing scheme. □ 

4. Lower Bounds. The first two theorems of this section together show that 
indeed Q,(n 2 ) bits are necessary to route on Kolmogorov random graphs in all models 
we consider, except for the models where nodes know their neighbours and label 
permutation or relabeling is allowed (II A [3 or II A 7). Hence the upper bound in 



Theorem 3.1 is tight up to order of magnitude. 

Theorem 4.1. For shortest path routing in o{n)-random labeled graphs where 
relabeling is not allowed and nodes know their neighbours (II A a), each local routing 
function must be stored in at least n/2~o(n) bits per node (hence the complete routing 
scheme requires at least n 2 /2 — o(n 2 ) bits to be stored). 

Proof. Let G be an o(ro)-random graph. Let F(u) be the local routing function 
of node u of G, and let |-F(it)| be the number of bits used to stor e F (u). Let E{G) 



be the standard encoding of G in n(n — l)/2 bits as in Definition 2.2. We now give 
another way to describe G using some local routing function F(u). 

(i) A description of this discussion in 0(1) bits; 

(ii) A description of u in exactly logn bits, padded with O's if needed; 

(iii) A description of the presence or absence of edges between u and the other 
nodes in V in n — 1 bits; 

(iv) A self-delimiting description of F(u) in |-F(u)| + 21og|F(u)| bits 

(v) The code E{G) with all bits deleted corresponding to edges (v, w) € E for 
each v and w such that F(u) routes messages to w through the least intermediary 
node v. This saves at least n/2 — o(n ) bits since there are at least n/2 — o(n) node s w 



such that (u, w) E by Lemma 2J3, and since the diameter of G is 2 by Lemma \2.6j 
there is a shortest path (u,v), (v,w) S E 2 for some v. Furthermore, we delete all bits 
corresponding to the presence or absence of edges between u and the other nodes in 
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V, saving another n — 1 bits. This corresponds to the n — 1 bits for edges connected 
to u which we added in one connected block above. 

In the description we have explicitly added the adjacency pattern of node u which 
explicitly added the adjacency pattern of node u which we deleted elswewhere. This 
zero-sum swap is necessary to be able to unambiguously identify the adjacency pattern 
of u in order to reconstruct G given n, as follows: Reconstruct the bits corresponding 
to the deleted edges using u and F{u) and subsequently insert them in the appropriate 
positions of the remnants of E(G). We can do so because these positions can be simply 
reconstructed in increasing order. In total this new description has 

n(n - l)/2 + 0(1) + O(logn) + |F(u)| - n/2 + o{n) 

which must be at least n(n — l)/2 — o(n) by Equation |l|. Hence, >n/2 — o(n), 

which proves the theorem. □ 

Theorem 4.2. Let G be an o(n) -random labeled graph on n nodes. Assume 
that the neighbours are not known (IA V IB) but relabeling is allowed ("/). Then, for 
shortest path routing the complete routing scheme requires at least n 2 /32 — o(n 2 ) bits 
to be stored. 

Proof. In the proof of this theorem we need the following combinatorial result. 
Claim 2. Let k and n be arbitrary natural numbers such that 1 < k < n. Let Xi, 
for 1 < i < k, be natural numbers such that Xi > 1. If ■_ Xi ~ n, then 

k 

y^riogffj] <n - k 

i=l 

Proof. By induction on k. If k = 1, then x\ = n and clearly [logn] < n — 1 if 
n > 1. Supposing the claim holds for k and arbitrary n and Xi, we now prove it for 
k' = k+ 1, n and arbitrary Xi. Let 53<=i x i = n - Then Yli=i Xi = n — Xk> ■ Now 

fe' fc 

^["loga^l =^2\\ogx l ~] + [logZfe'l 

1=1 i=l 

By the induction hypothesis the first term on the right-hand side is less than or equal 
to n — Xk> — k, so 

k' 

2J [logXi] < n - xy - k + [logxfe'l = n - k' + \logx k >] + 1 - x k - 

i=l 

Clearly [logx^./] + 1 < Xk> if Xk> > 1, which proves the claim. □ 

Recall that in model 7 each router must be able to output its own label. Using 
the routing scheme we can enumerate the labels of all nodes. If we cannot enumerate 
the labels of all nodes using less than n 2 /32 bits of information, then the routing 
scheme requires at least that many bits of information and we are done. So assume 
we can (this includes models a and (3 where the labels are not charged for, but can 
be described using logn bits). Let G be an o(n)-random graph. 

Claim 3. Given the labels of all nodes, we can describe the interconnection 
pattern of a node u using the local routing function of node u plus an additional 
n/2 + o(n) bits. 
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Proof. Apply the local routing function to each of the labels of the nodes in turn 
(these are given by assumption). This will return for each edge a list of destinations 
reached over that edge. To describe the interconnection pattern it remains to encode, 
for each edge, which of the destinations reached is actually its immediate neighbour. 
If edge i routes X{ destinations, this will cost [logXj] bits. By Lemma |2~5| the degree 
of a node in G is at least n/2 — o(n). Then in total, Y?a=i~ [loga^] bits will be 
sufficient; separations need not be encoded because they can be determined using the 
knowledge of all Xi's. Using Claim |] finishes the proof. □ 

Now we show that there are n/2 nodes in G whose local routing function requires 
at least n/8 — 31ogn bits to describe (which implies the theorem). 

Assume, by way of contradiction, that there are n/2 nodes in G whose local 
routing function requires at most n/8 — 3 logn bits to describe. Then we can describe 
G as follows: 

(i) A description of this discussion in O(l) bits, 

(ii) The enumeration of all labels in at most n 2 /32 (by assumption), 

(iii) A description of the n/2 nodes in this enumeration in at most n bits, 

(iv) The interconnection patterns of these n/2 nodes in n/8 — 3 logn plus n/2 + 
o(n) bits each (by assumption, and using Claim ^). This amounts to n/2(5n/8 — 
3 logn) + o(n 2 ) bits in total, with separations encoded in another nlogn bits, 

(v) The interconnection patterns of the remaining n/2 nodes only among them- 
selves using the standard encoding, in l/2(n/2) 2 bits. 

This description altogether uses 

0(1) + n 2 /32 + n + n/2(5n/8 - 3 logn) + 

+o(n 2 ) + nlogn + l/2(n/2) 2 = 

= n 2 /2-n 2 /32 + n + o(n 2 ) - n/2 logn 

bits, contradicting the o(n)-randomness of G by Eq. (|l|). We conclude that on at least 
n/2 nodes a total of n 2 /16 — o(n 2 ) bits are used to store the routing scheme. □ 
If neither relabeling nor changing the port assignment is allowed, the next the- 
orem implies that for shortest path routing on almost all such 'static' graphs one 
cannot do better than storing part of the routing tables literally, in (n 2 /2) logn bits. 
Note that it is known || that there are worst-case graphs (even in models where 
relabeling is allowed) such that n 2 logn — 0(n 2 ) bits are required to store the routing 
scheme, and this matches the trivial upper bound for all graphs exactly. But in our 



Theorem 4.3 we show that in a certain restricted model for almost all graphs asymp- 
totically (n 2 /2) logn bits are required and by Theorem |3.6| that many bits are also 
sufficient. 

Theorem 4.3. Let G be an o(n)-random labeled graph on n nodes. Assume that 
nodes do not know their neighbors and relabeling and changing the port assignment 
is not allowed (IA A a). Then, for shortest path routing each local routing function 
must be stored in at least (n/2) logn — O(n) bits per node (hence the complete routing 
scheme requires at least (n 2 /2)logn — 0(n 2 ) bits to be stored). 

Proof. If the graph cannot be relabeled and the port-assignment cannot be 
changed, the adversary can set the port-assignment of each node to correspond to 
a permutation of the dest ination nodes. Since each node has at least n/2 — o(n) 



neighbours by Lemma 2^5, such a permutation can have Kolmogorov complexity as 
high as (n/2) logn — 0(n) ||. Because the neighbours are not known, the local rout- 
ing function must for each neighbor node determine the port to route messages for 
that neighbor node over. Hence the local routing function completely describes the 
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permutation, given the neighbors, and thus it must occupy at least (n/2) logn — 0(n) 



Note that in this model (IA A a) we can trivially find by the same method a lower 
bound of n 2 logn — 0(n 2 ) bits for specific graphs like the complete graph and this 
matches exactly the trivial upper bound in the worst case. However, Theorem |4.3| 
shows that for this model the for almost all labeled graphs asymptotically 50% of this 
number of bits of total routing information is both necessary and sufficient. 

Even if stretch factors between 1 and 2 are allowed, the next theorem shows that 
Q,(n 2 logn) bits are necessary to represent the routing scheme in the worst case. 

Theorem 4.4. For routing with stretch factor < 2 in labeled graphs where relabel- 
ing is not allowed (a), there exist graphs on n nodes (almost (n/3)! such graphs) where 
the local routing function must be stored in at least (n/3) logn — 0{n) bits per node at 
n/3 nodes (hence the complete routing scheme requires at least (n 2 /9)logn — 0(n 2 ) 
bits to be stored). 

Proof. Consider the graph Gk with n = 3k nodes depicted in Figure [[]. Each 
node Vi in v^+i, ■ ■ . , «2fc is connected to Vi+k and to each of the nodes v%, . . . , Vk- Fix a 
labeling of the nodes Vi, . . . , v^k with labels from {1, . . . , 2k}. Then any labeling of the 
nodes v 2 k+i, ■ ■ ■ , «3fc with labels from {2k + 1, . . . , 3k} corresponds to a permutation 
of {2k + 1, . . . , 3A;} and vice versa. 

Clearly, for any two nodes V{ and Vj with 1 < i < k and 2k + 1 < j < 3fc, 
the shortest path from Vi to Vj passes through node Wj-fc and has length 2, whereas 
any other path from Vi to Vj has length at least 4. Hence any routing function on 
Gk with stretch factor < 2 routes such Vj from Vi over the edge (vi,Vj~k)- Then at 
each of the k nodes v\, . . . ,Vk the local routing functions corresponding to any two 
labclings of the nodes «2fc+i, ■ ■ ■ , V3k are different. Hence each representation of a local 
routing function at the k nodes Vi, 1 < i < k, corresponds one-one to a permutation 
of {2k + 1, . . . , 3k}. So given such a local routing function we can reconstruct the 
permutation (by collecting the response of the local routing function for each of the 
nodes k + 1, ... ,3k and grouping all pairs reached over the same edge). The number 
of such permutations is k\. A fraction at least 1 — l/2 fc of such permutations ir has 
Kolmogorov complexity C(tt) — fclogfc — 0(k) ||. Because tt can be reconstructed 
given any of the k local routing functions, these k local routing functions each must 
have Kolmogorov complexity fclogfc — O(k) too. This proves the theorem for n is a 
multiple of 3. For n = 3k — 1 or n = 3k — 2 we can use Gk, dropping Vk and Vfc-i- 



bits per node. 



□ 
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Note that the proof only requires that there be no relabeling; apart from that the 
direct neighbors of a node may be known and ports may be reassigned. 

By the above calculation there are at least (1 — l/2™/ 3 )(n/3)! labeled graphs on 
n nodes for which the theorem holds. □ 

Our last theorem shows that for full information shortest path routing schemes 
on Kolmogorov random graphs one cannot do better than the trivial upper bound. 

Theorem 4.5. For full-information shortest path routing on o(n)-random labeled 
graphs on n nodes where relabeling is not allowed (a), the local routing function oc- 
cupies at least n 2 /4 — o(n 2 ) bits for every node (hence the complete routing scheme 
requires at least n 3 /4 — o(n 3 ) bits to be stored). 

Proof. Let G be a graph on nodes {1,2,..., n} satisfying Eq. (|l]) with S(n) — o(n). 
Then we know that G satisfies Lemmas 2J^, 2j| Let F(u) be the local routing function 



of node u of G, and let be the number of bits used to encode F(u). Let E(G) 



be the standard encoding of G in n(n — l)/2 bits as in Def. 2.2. We now give another 
way to describe G using some local routing function F(u). 

(i) A description of this discussion in 0(1) bits. 

(ii) A description of u in logn bits. (If it is less pad the description with O's.) 

(iii) A description of the presence or absence of edges between u and the other 
nodes in V in n — 1 bits. 

(iv) A description of F(u) in \F(u)\ + 0(log|F(u)|) bits (the logarithmic term 
to make the description self-delimiting). 

(v) The code E{G) with all bits deleted corresponding to the presence or absence 
of edges between each w and v such that v is a neighbour of u and w is not a neighbour 
of u. Since there are at least n/2 — o(n) nodes w suc h that (u, w) ^ E and at least 
n/2—o(n) nodes v such that (u, v) <E E, by Lemma [?~5| , this saves at least (n/2 — o(n)) 2 
bits. 

From this description we can reconstruct 67, given n, by reconstructing the bits cor- 
responding to the deleted edges from u and F(u) and subsequently inserting them in 
the appropriate positions to reconstruct E(G). We can do so because F(u) represents 
a full information routing scheme implying that (v, w) G E iff (u,v) is among the 
edges used to route from u to w. In total this new description has 

n(n - l)/2 + O(logn) + \F(u)\ - n 2 /4 + o{n 2 ) 

which must be at least n{n — l)/2 — o(n) by Eq. ([!]). We conclude that = 
n 2 /4 — o(n 2 ), which proves the theorem. 

Note that the proof only requires that there be no relabeling; apart from that the 
direct neighbors of a node may be known and ports may be reassigned. □ 

5. Average Case. What about the average cost, taken over all labeled graphs of 
n nodes, of representing a routing scheme for graphs over n nodes? The results above 
concerned precise overwhelmingly large fractions of the set of all labeled graphs. The 
numerical values of randomness deficiencies and bit costs involved show that these 
results are actually considerably stronger than the corresponding average case results 
which are straightforward. 

Definition 5.1. For each labeled graph G, let T S {G) be the minimal total 
number of bits used to store a routing scheme of type S (where S indicates shortest 
path routing, full-information routing, and the like.). The average minimal total 
number of bits to store a routing scheme for S'-routing over labeled graphs on n nodes 
is Ts{G)/2 n ^ n ~ 1 ^ 2 with the sum taken over all graphs G on nodes {1,2, . . . ,n}. 
(That is, the uniform average over all the labeled graphs on n nodes.) 
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The results on Kolmogorov random graphs above have the following corollaries. 
The set of (3 log n)-random graphs constitutes a fraction of at least (1 — 1/n 3 ) of the 
set of all graphs on n nodes. The trivial upper bound on the minimal total number 
of bits for all routing functions together is 0{n 2 log n) for shortest path routing on 
all graphs on n nodes (or 0(n 3 ) for full- information shortest path routing). Simple 
computation shows that the average total number of bits to store the routing scheme 
for graphs of n nodes is (asymptotically and ignoring lower order of magnitude terms 
as in Table g): 



1. < 3n for shortest path routing in model IB V II (Theorem |3.l| ) 



2. < 6nlog 2 n for shortest path routing in model II A 7 where the average is 
taken over the initially labeled graphs on n nodes with labels in {1, 2, . . . , n} before 
they were relabeled with new and longer labels giving routing information (Theo- 



rem 3.2) 



3. < 38nlogn for routing with any stretch factor s for 1 < s < 2 in model II 



(Theorem 3.3); 



4. < nloglogn for routing with stretch factor 2 in model II (Theorem 3.4); 

5. 0(n) for routing with stretch factor 61ogn in model II (Theorem |3.5| with 



3); 



Theorem 4.3); 



6. > n 2 /2 for shortest path routing in model a (Theorem 4.1); 

7. > n 2 /32 for shortest path routing in model IA and IB (under all relabeling 
conventions, Theorem p~2] ); 

(n 2 /2) logn for shortest path routing in model IA A a (Theorem |3.6| and 
) 

9. = n 3 /4 for full information shortest path routing in model a (Theorem 3.7 
and Theorem |4.5| ) . 

6. Conclusion. The space requirements for compact routing for almost all la- 
beled graphs on n nodes, and hence for the average case of all graphs on n nodes, is 
conclusively determined in this paper. We introduce a novel application of the incom- 
pressibility method. The next question arising in compact routing is the following. 
For practical purposes the class of all graphs is too broad in that most graphs have 
high node degree (around n/2). Such high node degrees are unrealistic in real commu- 
nication networks for large n. The question arises to extend the current treatment to 
almost all graphs on n nodes of maximal node degree d where d ranges from O(l) to n. 
Clearly, for shortest path routing 0(n 2 \ogd) bits suffice, and || showed that for each 
d < n there are shortest path routing schemes that require a total of fl(n 2 \ogd) bits 
to be stored in the worst case for some graphs with maximal degree d, where we allow 
that nodes are relabeled by permutation and the port-assignment may be changed 
(IB A j3). This does not hold for average routing since by our Theorem |3.l| 0(n 2 ) 
bits suffice for d = O(n). (Trivially, 0(n 2 ) bits suffice for routing in every graph with 
d = O(l).) We believe it may be possible to show by an extension of our method that 
Q(n 2 ) bits (independent of d) are necessary and sufficient for shortest path routing in 
almost all graphs of maximum node degree d, provided d grows unboundedly with n. 

Another research direction is to resolve the questions addressed in this paper for 
Kolmogorov random unlabeled graphs in particular with respect to the free relabeling 
model (insofar as they do not follow a fortiori from the results presented here). 
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